Combined prediction of Tat and Sec signal peptides with hidden Markov models
نویسندگان
چکیده
MOTIVATION Computational prediction of signal peptides is of great importance in computational biology. In addition to the general secretory pathway (Sec), Bacteria, Archaea and chloroplasts possess another major pathway that utilizes the Twin-Arginine translocase (Tat), which recognizes longer and less hydrophobic signal peptides carrying a distinctive pattern of two consecutive Arginines (RR) in the n-region. A major functional differentiation between the Sec and Tat export pathways lies in the fact that the former translocates secreted proteins unfolded through a protein-conducting channel, whereas the latter translocates completely folded proteins using an unknown mechanism. The purpose of this work is to develop a novel method for predicting and discriminating Sec from Tat signal peptides at better accuracy. RESULTS We report the development of a novel method, PRED-TAT, which is capable of discriminating Sec from Tat signal peptides and predicting their cleavage sites. The method is based on Hidden Markov Models and possesses a modular architecture suitable for both Sec and Tat signal peptides. On an independent test set of experimentally verified Tat signal peptides, PRED-TAT clearly outperforms the previously proposed methods TatP and TATFIND, whereas, when evaluated as a Sec signal peptide predictor compares favorably to top-scoring predictors such as SignalP and Phobius. The method is freely available for academic users at http://www.compgen.org/tools/PRED-TAT/.
منابع مشابه
Metrics and Similarity Measures for Hidden Markov Models
Hidden Markov models were introduced in the beginning of the 1970's as a tool in speech recognition. During the last decade they have been found useful in addressing problems in computational biology such as characterising sequence families, gene finding, structure prediction and phylogenetic analysis. In this paper we propose several measures between hidden Markov models. We give an efficient ...
متن کاملMachine learning approaches for the prediction of signal peptides and other protein sorting signals.
Prediction of protein sorting signals from the sequence of amino acids has great importance in the field of proteomics today. Recently, the growth of protein databases, combined with machine learning approaches, such as neural networks and hidden Markov models, have made it possible to achieve a level of reliability where practical use in, for example automatic database annotation is feasible. ...
متن کاملA profile hidden Markov model for signal peptides generated by HMMER
SUMMARY Although the HMMER package is widely used to produce profile hidden Markov models (profile HMMs) for protein domains, it has been difficult to create a profile HMM for signal peptides. Here we describe an approach for building a complex model of eukaryotic signal peptides by the standard HMMER package. Signal peptide prediction with this model gives a 95.6% sensitivity and 95.7% specifi...
متن کاملSignal Peptide Hydrophobicity Modulates Interaction with the Twin-Arginine Translocase
The general secretory pathway (Sec) and twin-arginine translocase (Tat) operate in parallel to export proteins across the cytoplasmic membrane of prokaryotes and the thylakoid membrane of plant chloroplasts. Substrates are targeted to their respective machineries by N-terminal signal peptides that share a tripartite organization; however, Tat signal peptides harbor a conserved and almost invari...
متن کاملCompetition between Sec- and TAT-dependent protein translocation in Escherichia coli.
Recently, a new protein translocation pathway, the twin-arginine translocation (TAT) pathway, has been identified in both bacteria and chloroplasts. To study the possible competition between the TAT- and the well-characterized Sec translocon-dependent pathways in Escherichia coli, we have fused the TorA TAT-targeting signal peptide to the Sec-dependent inner membrane protein leader peptidase (L...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 26 22 شماره
صفحات -
تاریخ انتشار 2010